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Abstract — Now-a-days online shopping have become a 
popular shopping method ever since the internet has declared a 
takeover. There are many individuals that are looking for other 
trendy shopping and online shipping is just the fix for that. This 
is the reason why online stores are a grooming business today. 
Online shopping includes buying clothes, gadgets, shoes, 
appliance, or even every day groceries. Online shopping is a way 
of best transaction between money and goods which is done by 
end user without spending a huge time. Every product on online 
shopping website is associated with reviews which represents 
quality of that specific product. For every purchasing the 
consumers are purchasing the product online by reading the 
product review. But reading all these customer reviews before 
buying product, consumes more time. Hence to overcome from 
this issue we propose opinion Mining algorithm and semantic 
analysis technique. But major issues arise when there is 
assignment of fake review given by unidentified user. So this 
system will provide methodology which will allow only those 
customers to give review who have purchased product from that 
website. Others users are not allowed to give review. This will 
decrease the wrong reviewing of product and customer will get 
reliable product. 

Index Terms — Opinion Mining Algorithm, Fake Reviews, 
Sentimental keyword analysis. 


I. Introduction 

A huge number of product reviews are springing up on the 
Internet. From these reviews, customers can obtain first-hand 
assessments of product information and supervision of their 
purchase actions. Meanwhile, manufacturers can obtain 
immediate feedback and opportunities to improve the quality 
of their products in a latest fashion. Thus, mining opinions 
from online reviews has become an increasing urgent activity 
and has attracted a great deal. Customer shopping was a 
concept in which a customer used to buy a product from a mall 
or from shop. And customers were paying money to supplier 
at the time of shopping (purchasing). For traditional 
shopping, customer needs to be directly present at shopping 
place. Also there was no customer review system available to 
describe the quality of product. Customer used to buy product 
on the basis of retailers opinion or suggestion. Sometimes lots 
of retailers give fake feedback to sell their product. But 
Now-a-days internet has done massive amount of evolution in 
shopping. Every activity is getting associated with internet. 

While online shopping customer can buy their needed 
products by sitting at home and using smart phones, laptops, 
computers etc. Here user is doing online payment by means of 
credit card or net banking systems. There is no need of 
customer to be physically gone to shop or at mall for 
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purchasing product and paying money. For choosing good 
quality products, online shopping provides review of each and 
every product given by various purchased customers. 
Normally all customer refers these reviews before buying any 
product. But customers needs more time to read each and 
every review of the particular product given by the other 
purchased customers and then take decision for purchasing 
product or not. As some reviews are maybe positive (good) 
and negative (bad) so customer has to examine each and every 
review before choosing that product. So we have proposed a 
new algorithm to guide customer for choosing a best one 
products. Here we are going to shortlist the positive reviews 
of particular product by using opinion mining algorithm. 

Another problem arises when there is allocation of false 
review to any product. For example, if one mobile phone is 
available for selling on two different major e shopping 
website like X and Z. The Z website can give fake negative 
(bad) feedback (Review) to the phone selling at website X due 
to which purchaser will reject that phone although it is having 
good quality Specifications. To avoid this problem we are 
going to design the mechanism which will accept the review 
only from those customers who really have brought that 
product this processed based on customer purchase bill 
number. This will minimize fake reviewing of products done 
by challengers. 

To overcome from the above problem we implement the 
opinion mining algorithm. Opinion mining is the algorithm of 
determining the approach of the customer with respect to the 
product. In general opinion of the user is most important for 
all organization or to individuals to improve the performance 
of the service. So opinion mining is the algorithm to extract 
(mining) the information about particular things based upon 
the customer reviews. The opinion mining is very interesting 
and important area of research due to the rising web 
technology. The machine learning is used to classify the user 
opinion text. In the section there are different types of 
machine learning techniques. They are opinion mining and 
sentimental keyword analysis. 


| This phone has a colorful and big screen | 

This phone has a colorful and big screen 


II. BACKGROUND AND RELATED WORK 

A. Opinion Mining of Movie Review using Hybrid Method 
of Support Vector Machine and Particle Swarm 
Optimization: 
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Day-to-day, online social media is online discourse where 
people contribute to create content, share it, bookmark it, and 
network at an impressive rate. The faster communication and 
ease of use in social media today is Twitter. The messages on 
Twitter include reviews and opinions on certain topic such as 
movie, book, product, politic, and so on. Based on this 
situation, this research attempts to use the messages of twitter 
to review a movie by using opinion mining or sentiment 
keyword analysis. Opinion mining refers to the application of 
natural language processing, computational linguistics, and 
text mining to classify whether the film is good or not based 
on message opinion. Support Vector Machine (SVM) is 
manage the learning methods that examine data and recognize 
the patterns that are used for classification. This research 
concern on binary classification which is classified into two 
classes. Those classes are positive and negative. The positive 
(+ve) class shows good message opinion; otherwise the 
negative class shows the bad message opinion of certain films. 
This justification is based on the accuracy level of SVM with 
the validation process uses 10-Fold cross verification and 
confusion matrix. The hybrid Partial Swarm Optimization 
(PSO) is used to improve the election of best parameter in 
order to resolve the dual optimization problem. The result 
shows the improvement of accuracy level from 71.87% to 
77%. 

B. Web product ranking using opinion mining: 

Online shopping is becoming increasingly important as more 
manufacturers sell products on the online shopping, and many 
users are using the Internet to communicate and share their 
opinions. However, it is impossible and difficult for 
consumers to read all product reviews. Therefore, it is 
essential to design effective systems to review the pros and 
cons of product characteristics, so that user can quickly find 
their favorable products. In this project, we present a product 
ranking system using opinion mining Algorithm. Users can 
specify product features to view the ranking results of all 
matched products. In this system, we consider three issues 
while analysis product scores: 1) product reviews, 2) product 
popularity, and 3) product release month. Finally, the 
experimental results execute that the system is practical and 
the ranking results are interesting. 

C. Opinion Mining Using Frequent Pattern Growth 

Method from Unstructured Text: 

In the last decade, the area of opinion mining has 
experienced a major expansion because of the increase in 
online unstructured data which are contributed by reviewers 
over various different topics and subjects. These data 
sometimes become important and need for users who want to 
take their decision based on user opinions of actual customer 
of the product. In this paper, we present the FP-growth 
method is used for frequent pattern mining from review 
documents which act as a spine for mining the opinion words 
along with their applicable features by experimental data over 
from two different domains which are very different in their 
nature. 

D. Opinion Mining on Social Media Data: 

Micro blogging (Social Media) has become a very popular 
communication tool among online users in recent years. 


Information is generated and managed through either via 
computer or mobile devices by single person and is consumed 
by many other persons, with most of this customer-generated 
content being textual information. However this trouble is 
challenging because a micro-blog post is usually very short 
and colloquial and oldest opinion mining algorithms do not 
perform well. Therefore, in this paper, we propose a new 
system architecture that can mechanically analyze the 
sentiments of these messages. We combine this system with 
manually annotated data from Social media, for the task of 
sentiment analysis. In this system, machines can learn how to 
automatically extract the set of customer messages (data) 
which contain opinions, filter out non opinion messages and 
conclude their sentiment. Experimental results confirm the 
effectiveness of our system on sentiment analysis in real 
micro blogging applications. 


III. Algorithms 
TEXT MINING ALGORITHM 

Text mining is the study of data contained in natural language 
text. The application of text mining techniques to solve 
business difficulty is called text analytics. Text mining can 
help an organization to derive potentially valuable business 
insight from text-based content such as word documents, 
email and postings on social media streams like FB, Twitter 
and Linkedln. Mining unstructured data through natural 
language processing (NLP), statistical modeling (SM) and 
machine learning techniques (MLT) can be challenging, 
because natural language text is often inconsistent. It contains 
ambiguity caused by contradictory syntax and semantics, 
including slang, language specific to vertical industries and 
age groups, double entendres and irony. 

High-quality data is typically derived through the devising 
of patterns and trends through means statistical pattern 
learning. Text mining frequently involves the process of 
framing the input text (usually parsing, along with the totaling 
of some derived linguistic features and the removal of others, 
and subsequent insertion into a DB), deriving patterns inside 
the structured data, and lastly calculating and interpretation of 
the output. 'High quality' in text mining usually refers to some 
particular combination of relevance and interestingness. 
Typical text mining tasks include text 
clustering, concept/entify extraction, production of granular 
taxonomies, sentimental keyword analysis, document 
summarization, and entity relation modeling. 

OPINION MINING ALGORITHM 

Opinion mining is a kind of natural language processing for 
tracking the feel of the public about a particular 
product. Opinion mining, which is too called sentiment 
analysis, involves structure a system to collect and categorize 
opinions regarding a product. Automated opinion mining 
frequently uses machine, a type of artificial intelligence (AI), 
to mine text for sentiment. Opinion mining can be helpful in 
several ways. It can help marketers estimate the success of an 
advertisement campaign or new product launch, determine 
which version of a product or service are popular and identify 
which demographics like or Unlike particular product 
features. For example, a review on a website (online) might be 
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broadly positive about a digital camera, but be particularly 
negative about how trouble it is. Being able to identify this 
kind of information in a systematic way give the vendor a 
much clearer picture of public opinion(suggestions) than 
surveys or focus groups do, because the data is created by the 
customer. 

There are some challenges in opinion mining. The first word 
that is consider to be positive (+ve) in one circumstances it 
may be consider as a negative (-ve) in some another situation. 
Take the word "long" for instance. If a purchaser said 
a laptop's battery life-time was long, that would be a positive 
(+) opinion. If the purchaser said that the laptop's start-up 
time was too long, however, that would be is a negative (-) 
opinion. These differences mean that an opinion mining 
system trained to gather opinions on some type of product 
(goods) or product feature may not perform very well on 
another. 

Opinion mining is a subtopic of information (data) retrieval 
with considerable research done. Several methods exist to 
find out an author’s view on a topic from natural language 
(NL) textual information. These generally employe some 
form of machine learning approach, and have unreliable 
degrees of effectiveness. 

SENTIMENT ANALYSIS 

It is also known as opinion mining refers to the use of natural 
language processing (NLP), text analysis and computational 
linguistics to identify and extract (mining) subjective 
information in source materials. Sentiment analysis is widely 
applied to reviews, suggestion and social media for a variety 
of applications, ranging from marketing to customer service. 

Classification 

There are two major forms of data analysis that can be used 
for extracting models describing important classes or to 
predict future data trends. These two major forms are as 
follows 

• Classification 

• Prediction 

Classification models predict categorical class and prediction 
models predict continuous valued 

functions. For example, we can build classification model to 
categorize bank loan applications as 

safe or risky, or a prediction models to predict the 
expenditures in dollars of prospective customers on computer 
equipment given their income and occupation. Following are 
the examples of cases where the data analysis (DA) task is 
Classification. 

Examples 

• A bank loan officer wants to analyze the data in order to 
know which purchaser (loan applicant) is risky or which are 
safe. 

• A marketing manager at a company needs to analyze a 
purchaser with a given profile, who will buy a new computer. 

• In both of the above example, a model or classifier is 
constructed to forecast the categorical labels. These labels are 
unsafe or safe for loan application data and sure or not for 
marketing data. 



IV. CONCLUSION 

The concept of this paper is to determine the customer 
reviews of mobile phones at aspect level. System performs the 
Opinion mining on the given reviews and the feature wise 
summarized results generated by the system will be useful for 
the user in taking the decision .Experimental results indicate 
that the ‘opinion mining algorithm’ perform well and has 
achieved the accuracy of 93. 2% .Opinion mining algorithm is 
necessary because nowadays everyone is busy and they don’t 
have a time to read all the positive or negative reviews if 
someone just wants to know about some feature of the 
product. Opinion mining has proved to be helpful in these 
situations as compared to simple opinion mining. 

In future work, these efforts would be done to make some 
enhancements in this technique in such a way that it can 
identify the repeated reviews and classify those user reviews 
only once. 
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